Estimation of Window Coefficients for Dynamic Feature Extraction for HMM-Based Speech Synthesis
نویسندگان
چکیده
In standard approaches to hidden Markov model (HMM)-based speech synthesis, window coefficients for calculating dynamic features are pre-determined and fixed. This may not be optimal to capture various context-dependent dynamic characteristics in speech signals. This paper proposes a data-driven technique to estimate the window coefficients. They are optimized so as to maximize the likelihood of trajectory HMMs given data. Experimental results show that the proposed technique can achieve a comparable performance with the meanand variance-updated trajectory HMMs in the naturalness of synthesized speech, while offering significantly lower computational cost.
منابع مشابه
Speech enhancement based on hidden Markov model using sparse code shrinkage
This paper presents a new hidden Markov model-based (HMM-based) speech enhancement framework based on the independent component analysis (ICA). We propose analytical procedures for training clean speech and noise models by the Baum re-estimation algorithm and present a Maximum a posterior (MAP) estimator based on Laplace-Gaussian (for clean speech and noise respectively) combination in the HMM ...
متن کاملHMM based Automatic Speech Recognition Analysis
This project's 'HMM Based Automatic Speech Recognition Analysis main motive is just to generate an Automatic speech recognition which is clear an accurate using Hidden Markov Model (HMM) to get accurate results at number of frequency ranges related to human voice. Here is a record of 12 different words which is recorded by using a number of different speakers that includes male and female both ...
متن کاملImproved Linear Predictive Coding Method for Speech Recognition
In this paper, improved Linear Predictive Coding (LPC) coefficients of the frame are employed in the feature extraction method. In the proposed speech recognition system, the static LPC coefficients + dynamic LPC coefficients of the frame were employed as a basic feature. The framework of Linear Discriminant Analysis (LDA) is used to derive an efficient and reduced-dimension speech parametric s...
متن کاملAn HMM-Based Approach to Flexible Speech Synthesis
The increasing availability of large speech databases makes it possible to construct speech synthesis systems, which are referred to as corpusbased, data-driven, speaker-driven, or trainable approach, by applying statistical learning algorithms. These systems, which can be automatically trained, not only generate natural and high quality synthetic speech but also can reproduce voice characteris...
متن کاملAn experimental HMM-based postal OCR system
It is almost universally accepted in speech recognition that phoneor word-level segmentation prior to recognition is neither feasible nor desirable, and in the dynamic (pen-based) handwriting recognition domain the success of segmentation-free techniques points to the same conclusion. But in image-based handwriting recognition, this conclusion is far from being firmly established, and the resul...
متن کامل